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ABSTRACT 


In  recent  years,  the  study  of  systems  with  multiple  sensors  has  been  an  active  area 
of  research.  In  this  paper,  we  focus  on  the  computational,  i.e.,  architectural,  algorithmic, 
and  synchronization  issues  related  to  competitive  information  integration  in  a 
Distributed  Sensor  Network  (DSN).  The  proposed  architecture  of  the  DSN  consists  of  a 
set  of  binary  trees  whose  roots  are  fully  connected.  Each  node  of  the  tree  has  a  processing 
element  and  one  or  more  sensors  associated  with  it.  The  information  from  each  of  the 
sensors  has  to  be  integrated  in  such  a  manner  that  the  communication  costs  are  low  and 
that  the  real  time  needs  are  met.  We  present  an  information  integration  algorithm  which 
has  a  low  message  cost  (linear  in  the  number  of  nodes  of  the  network  )  and  a  low 
distributed  computation  cost. 

In  a  distributed  environment  there  is  no  central  clock  which  regulates  the  activitiesi.^ 
of  each  node.  Further,  the  clock  at  each  node  is  typically  not  accurate.  The  estimates 
from  each  of  the  sensors  which  need  to  be  integrated  have  to  be  temporally  "close  to 
each  other",  however.  We  consider  the  problems  associated  with  synchronizing 
information  to  be  integrated  in  the  presence  of  imperfect  clocks.  ^^We  derive  a)  the 
relationships  between  the  clocks  of  the  processing  elements  in  the  network  for  proper 
information  integration  and  b)  an  upper  bound  on  the  period  between  consecutive 
resynchronizations  of  a  processing  element’s  clock  with  the  central  time  serven'  Finally, 
we  discuss  the  fault  tolerant  features  of  the  network  and  the  integration  algorithm. 


KEYWORDS  and  PHRASES  :  Distributed  Sensor  Network,  Integration  Algorithm, 

Fault  Tolerance,  Clock  Synchronization,  Abstract  Sensor 
Estimate. 
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1.0  INTRODUCTION 

In  recent  years  the  study  of  multisensor  systems  has  been  an  active  area  of 
research  [14,  4,  2, 11, 10].  Our  interest  is  in  multiple  sensor  systems  which  consist  of  tens  or 
even  hundreds  of  sensors.  Such  a  system  is  usually  organized  as  a  distributed  sensor 
network  (DSN)  which  consists  of  a  set  of  sensors,  a  set  of  processing  elements  (PEs),  and  a 
communication  network  interconnecting  the  various  PEs.  One  or  more  sensors  is 
associated  with  each  PE.  A  need  for  DSNs  arises  in  diverse  applications  such  as 
intelligent  robotic  systems,  aircraft  navigation,  systems  which  monitor  the  activities  on  an 
industrial  assembly  line,  etc.[4,  10, 15].  A  significant  advantage  offered  by  the  integration 
of  information  from  disparate  sensors  is  that  the  reliability  and  the  fault  tolerance  of  the 
sensor  system  are  both  enhanced.  Another  advantage  is  the  suppression  of  the  effects  of 
noise.  This  is  because  the  noise  measured  by  different  sensors  tends  to  be  uncorrelated, 
while  the  signal  of  interest  remains  correlated.  Also,  by  careful  selection  of  disparate 
sensors,  one  can  compensate  for  the  shortcomings  and  peculiarities  of  particular  types  of 
sensors. 

The  method  of  integrating  information  from  the  various  sensor  outputs,  called 
information  integration,  depends  on  whether  the  sensors  provide  a)  competitive 
information  or  b)  complementary  information  [4,  2].  In  the  former  case,  each  sensor 
ideally  provides  identical  information.  In  reality,  however,  the  information  provided  by 
each  sensor  may  vary  widely  because  of  a  variety  of  reasons  such  as  the  noise  at  a 
particular  sensor  site,  unreliable  transmission  due  to  channel  failures,  loss  of  accuracy  in 
translating  information  from  disparate  sources  into  a  common  information  medium.  It  is 
therefore  necessary  for  the  information  from  the  sensors  to  be  combined  in  a  meaningful 
and  efficient  manner  to  obtain  a  fairly  accurate  result.  Complementary  information 
integration  is  done  when  only  partial  information  is  available  from  each  sensor;  such 
information  is  then  integrated  to  build  the  complete  information.  Note  that  in  either 
case,  we  need  an  abstract  model  of  the  sensor  to  compare  or  combine  the  information 
from  disparate  sensor  sources. 

1.1  SCOPE  OF  THIS  PAPER 
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From  the  foregoing  discussion,  it  is  dear  that  the  integration  of  multiple  disparate 
sensors  into  an  effective  sensor  network  requires  the  solution  of  several  problems.  These 
problems  could  be  broadly  categorized  into  two  categories: 

(a)  Those  related  to  sensor  modeling  and 

(b)  Those  related  to  sensor  information  integration. 

Examples  in  the  first  category  include  the  problems  associated  with  the 
spedfication  of  appropriate  sensor  parameters  [4]  or  the  probabilistic  model  of  individual 
sensor  performance  [1].  Examples  in  the  second  category  include  methods  needed  to 
abstractly  represent  the  information  gained  from  sensors  so  that  this  information  may 
easily  be  integrated  and  also  methods  to  deal  with  possible  differences  in  points  of  view 
on  frames  of  references  among  multiple  sensors  [1,  8].  For  an  excellent  discussion  of  the 
problems  and  current  state  of  the  art  in  multisensor  integration,  the  reader  is  referred  to 
the  survey  paper  by  Luo  and  Kay  [10]. 

In  this  paper  we  focus  on  the  architectural,  algorithmic,  and  the  clock 
synchronization  issues  related  to  competitive  information  integration  needed  in  the 
distributed  sensor  network.  Our  paper  discusses  the  situation  when  the  sensor  outputs 
are  connected  intervals  on  the  real  line  R.  Our  interval  estimates  are  tolerance  zones 
containing  sensors  associated  with  the  PEs  of  the  distributed  sensor  network  and  our 
technique  of  integration  focuses  on  obtaining  reliable  regions  for  correct  sensor  values 
with  fault  tolerance.  These  issues  have  not  received  much  attention  in  the  context  of 
multisensor  systems.  The  architectural  issues  are  discussed  to  a  certain  extent  by  [5, 15]. 

1.2  ORGANIZATION  OF  THE  PAPER 


The  paper  is  organized  as  follows:  In  section  2  we  give  the  architecture  of  the  DSN 
under  consideration.  Our  abstract  sensor  model,  detailed  in  section  3,  is  an  extension  of 
the  model  developed  by  Marzullo[12].  In  section  4,  we  motivate  the  need  for  clock 
synchronization  and  present  the  information  integration  algorithm.  In  section  5,  we 
present  certain  fault-tolerant  features  of  the  network  and  the  integration  algorithm. 
Section  6  contains  some  concluding  remarks  and  the  future  directions  this  research 
could  take.  The  Appendix  presents  a  computational  characterization  of  the  integration 
algorithm  using  Heaviside  functions. 
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2.0  ARCHITECTURE  OF  THE  DSN 


Two  types  of  architectures  for  distributed  networks  have  been  discussed  by 
Wesson,  et.  al  [13],  namely  the  committee  (or  anarchic)  organization  and  the  hierarchical 
organization.  In  a  committee  organization,  each  node  (i.e.,  the  PE  and  its  associated 
sensors)  in  the  network  is  autonomous  and  can  broadcast  information  to  all  other  nodes 
in  the  network.  In  a  hierarchical  organization,  the  nodes  are  assembled  as  strict 
hierarchies  of  abstraction  levels.  At  each  level,  nodes  receive  information  from  the  lower 
level  nodes,  integrate  the  information  received  according  to  their  position  in  the 
hierarchy,  and  send  abstracted  reports  to  nodes  at  higher  levels  in  the  hierarchy.  The 
node  at  the  highest  level,  called  the  commander,  makes  appropriate  decisions  based  on 
the  received  information,  globally  interprets  the  required  hypotheses,  and  controls  the 
netw'ork  appropriately. 

The  committee  organization,  which  requires  ©(N2)  interconnections  in  an  N  node 
DSN,  would  be  expensive  and  practically  infeasible  when  one  considers  a  large  number 
of  nodes.  The  hierarchical  organization,  on  the  other  hand,  does  not  suffer  from  this 
disadvantage  (it  requires  0(N)  interconnections  in  an  N  node  DSN),  but  could  produce 
inaccurate  estimates  since  data  sharing  is  not  allowed  between  the  low  level  sensors  and 
since  errors  accumulate  as  one  goes  up  the  hierarchy. 

In  this  paper  we  consider  a  DSN  consisting  of  several  clusters.  Each  cluster  is 
organized  in  a  hierar  chical  marmer  as  a  binary  tree  of  nodes.  Each  node  has  a  PE  and  an 
associated  sensor.  The  leaf  nodes  are  at  the  first  level  of  the  network.  The  parent  of  a 
node  is  at  one  level  higher  than  its  children.  A  clock  runs  on  each  PE.  There  is  a  message 
buffer  associated  with  each  channel  as  shown  in  Figure  2.3.  After  the  information  is  read 
by  a  sensor,  the  associated  PE  translates  that  information  into  an  abstract  sensor 
estimate  (more  detailed  description  on  abstract  sensor  estimate  is  given  in  the  next 
section),  time  stamps  the  estimate  with  the  current  time  and  places  the  abstract  estimate 
in  the  buffer  (see  Figure  2.3).  The  root  of  each  cluster,  called  the  commander,  forms  a 
complete  interconnection  with  all  its  peer  nodes.  Such  an  architecture  illustrated  in 
Figures  2.1,  2.2  and  2.3  could  be  thought  of  as  a  hierarchical  system  with  a  committee 
organization  at  the  top  level.  Each  PE  communicates  to  its  children  or  its  parent  over 
channels  through  naessages. 
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In  addition  to  the  issues  already  mentioned,  we  are  also  interested  in  the  fault 
tolerance  of  the  system,  i.e.,  the  individual  sensors  could  be  faulty  but  the  ensemble  of 
sensors  should  possess  reasonable  fault  tolerant  properties.  We  address  the  issue  of 
fault  tolerance  briefly  in  Section  5.  Fault  tolerance,  however,  is  not  the  primary  focus  of 
the  paper. 

Some  of  the  important  issues  which  need  to  be  examined  are: 

(i)  How  should  the  observations  available  at  a  sensor  or  a  sensor  cluster  be 
processed  so  as  to  recover  the  relevant  signal  parameters  ? 

(ii)  What  are  the  message  routing  strategies  for  the  network? 

(iii)  How  does  internodal  coupling  affect  obtaining  of  unbiased  estimates  of  a 
parameter? 

(iv)  How  do  we  structure  the  signal  processing  cluster  units? 

The  first  issue  is  discussed  in  detail  in  the  next  two  sections.  The  second  issue, 
message  routing  involves  considering  the  time  delays  in  message  passing  in  distributed 
sensor  networks.  In  Section  4,  we  present  a  clock  synchronization  scheme  which 
integrates  information  considering  time  delays.  The  other  issues  of  importance  related  to 
message  passing,  which,  however,  are  not  examined  in  detail  in  this  paper  are:  a)  fail-safe 
message  passing  strategies  and  b)  message  routing  strategies  when  there  exist  multiple 
paths  between  two  nodes  in  the  DSN.  To  illustrate  a  problem  relating  to  the  third  issue, 
consider  Figure  2.4.  The  estimate  that  P  receives  from  L  could  itself  be  a  function  of  P's 
estimate  since,  in  addition  to  the  path  from  L  to  P,  there  exists  a  path  from  P  to  L.  As  a 
result,  the  combined  estimate  calculated  at  P  could  be  biased  estimate  favoring  P's 
estimate  since  the  latter  has  been  included  twice  in  the  calculation  for  the  combined 
estimate. 
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Figure  2.1  Overview  of  DSN  Architecture 


Figure  2.2  Sensor  Cluster  Unit 
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Details  of  a 
Commander  Node 
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Figure  2.4  L's  integrated  sensor  value  that  P  obtains  could  itself  be  a 
function  of  P's  value  since  there  exists  a  path  from  P  to  L. 

We  focus  mainly  on  the  first  two  issues  in  this  paper.  The  other  issues  do  not 
concern  us  since  we  restrict  our  discussion  to  a  distributed  sensor  network  with  a 
complete  binary  tree  of  nodes  at  each  cluster  unit  and  with  unidirectional  information 
flow  from  nodes  at  the  lower  level  to  the  nodes  at  the  higher  level. 


3.0  THE  ABSTRACT  SENSOR  MODEL 

A  physical  sensor  is  a  device  which  samples  a  physical  variable.  For  example,  a  PE 
controlling  a  surveillance  satellite  antenna  might  have  an  infra-red  sensor  as  a  physical 
sensor.  The  PE  may  obtain  measurements  either  by  polling  it  lor  its  current  value  or  by 
being  asvTichronously  alerted  when  a  certain  signature  is  detected.  Assume  for  simplicity 
that  each  sensor  outputs  a  single  value  represented  by  a  real  number.  In  reality,  however, 
because  of  reasons  such  as  a)  limited  accuracy  of  the  sensor,  b)  effect  of  noise  signals,  c) 
various  delays  (the  communication  delay,  for  example)  between  the  time  ts  that  the  value 
is  read  and  the  time  tp  that  the  value  is  processed  by  the  PE,  and  d)  inaccuracy  in 
knowing  the  correct  values  of  ts  and  tp  in  distributed  systems,  it  is  not  meaningful  for 
the  sensor  value  to  be  represented  by  a  single  real  value.  (Note  that  to  consider  the  effect 
of  c)  mentioned  above,  the  extrapolation  characteristics  of  the  process  have  to  be  known). 
Hence  we  define  an  abstract  sensor  to  be  a  piecewise  continuous  function  which  maps  a 
physical  sensor  value  v  (v  e  R)  into  a  dense  interval  [a,  b]  (a,b  e  R)  that  contains  the 
phvsicai  value.  The  width  of  an  interval  is  (b-a).  Ideally,  this  interval  [a,  b]  converges  to  v 
but,  in  realitv,  the  width  is  finite.  This  abstract  sensor  model  is  based  on  the  work  of 
Marzullo  [12].  He  also  shows  that  given  a  physical  sensor,  it  may  not  be  easy  to  implement 
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the  corresponding  abstract  sensor,  as  it  requires  knowledge  of  the  characteristics  of  the 
physical  process  being  monitored.  We  assume  that  this  knowledge  is  available.  In  many 
cases  when  the  characteristics  are  only  approximately  known,  a  high  sampling  rate  will 
compensate  for  this  lack  of  knowledge.  We  use  this  model  to  construct  a  fault-tolerant 
abstract  sensor. 

The  abstract  sensors  representing  two  or  more  sensors  could  be  combined  to  form 
an  abstract  estimate.  To  keep  the  terminology  simple,  we  refer  to  the  abstract  sensor  as 
the  abstract  estimate  also.  An  abstract  estimate  could,  in  turn,  be  combined  with  one  or 
more  abstract  estimates  to  form  a  new  abstract  estimate.  A  k-interval  abstract  estimate  is 

defined  to  be  an  empty  set  or  a  set  of  up  to  k  intervals  {(11,  ul), . ,(li,  ui)}  (where  1  <  i  <  k) 

such  that  Ij  <  uj  (where  1  <  j  <  i)  and  uj  <  lj+1  (where  1  <  j  <  i-1).  Observe  that  these  intervals 
do  not  overlap.  If  any  two  did  overlap  then  they  would  be  in  one  interval.  The  width  of  a  k- 
interval  abstract  estimate  is  defined  to  be  the  value: 

k 

X(ui-li) 

i=l 

The  next  section  addresses  the  question  of  how  the  abstract  sensors  or  abstract 
estimates  are  combined  to  yield  new  abstract  estimates. 


4.0  INFORMATION  INTEGRATION  OF  ABSTRACT  ESTIMATES 


In  this  section,  we  present  a  method  to  combine  information  from  multisensor 
systems  which  we  call  iiiformation  integration.  Before  we  present  the  actual  algorithm,  we 
motivate  the  need  to  combine  the  information  from  different  sensors  and  also  show  that 
the  meaningful  combining  of  information  necessitates  proper  clock  synchronization 
which  isTion-trivial  to  achieve  in  a  distributed  environment  such  as  ours. 

4.1  MOTIVATION 

Structuring  sensor  information  is  a  prerequisite  for  the  integration  problem  in  the 
cluster-tree  network.  Suppose  a  node  A  at  an  intermediate  stage  of  the  network  receives 
abstract  estimates  di,  U])  and  {h,  U2)  from  its  two  children.  Assume  that  h  <  I2  and  that  the 
two  intervals  do  not  overlap.  Node  A  can  communicate  the  2-estimate  <(li,  U]),  (l2,U2)>  to 
its  parent  to  signify  that  the  true  value  lies  in  one  of  the  two  intervals  (which  of  the  two 
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intervals  contains  the  true  value  would  be  resolved  at  a  higher  level)  or  it  could  transmit  a 
1-estimate  di,  uo).  Each  interval  requires  the  communication  of  two  values.  Observe  that 
in  the  former  case,  at  stage  i  of  the  network,  0(2^ )  number  of  values  need  to  be 
communicated  by  a  node.  This  has  a  number  of  disadvantages:  a)  high  communication 
cost  and  b)  high  computation  cost  at  the  root  node  (commander  node  in  our  case).  Since 
these  estimates  are  usually  a  function  of  time,  real  time  needs  may  preclude  the 
computation  of  estimates  which  have  an  exponential  growth  rate  at  the  root  node.  On  the 
other  hand,  in  the  latter  case,  the  communication  requirements  are  lesser  (later  we  will 
characterize  this  measure  more  precisely)  and  the  computation  requirements  are 
distributed  among  all  the  nodes.  The  accuracy  of  the  result  could  suffer,  however.  In  the 
example  considered,  we  transmit  a  1-estimate  di,  U2)  even  though  we  know  that  the  true 
value  is  not  in  the  interval  (ui,  I2).  From  the  above  discussion  it  is  clear  that  a)  there  is  a 
trade-off  between  accuracy  and  communication  and  b)  the  width  of  an  estimate  is 
inversely  proportional  to  the  accuracy.  This  suggests  that  the  width  could  be  used  as 
measure  of  inaccuracy  assuming,  of  course,  that  the  true  value  is  to  be  found  in  one  of 
the  intervals  of  the  estimate. 

In  reality,  the  disparate  sensors  may  require  a  set  of  variables  to  represent  each 
output.  Each  of  these  variables  could  itself  be  represented  by  a  vector  of  values.  If  there 
are  m  values  representing  the  state  of  the  system  and  each  value  is  represented  by  a  k- 
interval  abstract  estimate,  then  the  state  of  the  system  (each  time  the  sensors  are 
sampled)  could  be  represented  by  a  (m  x  k)  matrix. 

From  the  above  discussion,  the  need  for  combining  information  at  the  nodes  of  the 
network  to  reduce  the  communication  and  to  distribute  the  computation  is  clear.  Any 
information  combining  algorithm  should  possess  the  following  desirable  properties: 

(i)  low  message  complexity, 

(ii)  distributed  processing  uf  information, 

(iii)  reasonable  fault  tolerance. 

(iv)  reasonable  accu-acy,  i.e.,  the  true  physical  value(s)  should  be  very  likely 
found  in  the  combined  estimate  with  the  width  of  the  estimate  being  low. 

A  variety  of  techniques  including  use  of  weighted  averages,  Kalman  filter,  multi- 
Bayesian  approach,  statistical  decision  theory,  and  fuzzy  logic  have  been  used  in 
muUisensor  integration  [10].  We  shall  present  a  simple  and  intuitively  appealing 
integration  algorithm  based  on  estimates  represented  by  intervals.  Our  method  is  based 
on  the  reasonable  assumption  that  overlapping  intervals  are  likely  to  contain  the  value(s) 
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of  interest.  This  method  could  be  considered  as  a  variation  of  the  use  of  weighted 
averages  as  applied  to  intervals.  As  we  shall  see  in  Section  4.4,  even  this  simple 
integration  algorithm  becomes  intricate  after  it  is  "clock-synchronized".  A  future  goal  of 
this  research  is  to  consider  more  sophisticated  integration  algorithms  after  clock 
synchronization  is  well  understood.  In  the  next  section,  we  discuss  the  development  of 
the  integration  algorithm.  For  explanatory  purposes,  we  consider  1-interval  abstract 
estimates. 

4.2  DEVELOPMENT  OF  THE  INTEGRATION  ALGORITHM 

The  idea  behind  the  one  interval  integration  algorithm  is  the  following:  an 
overlapped  interval  (if  it  e.xists)  is  more  likely  to  contain  the  correct  value  of  the  physical 
variable  of  interest  than  a  non-overlapped  interval.  This  idea  is  similar  to  the  N-modular 
Redundancy  (NMR)  technique  used  in  many  passive  and  hybrid  fault  tolerant  systems 
(N  =  3  in  our  case)  [7].  Since  we  use  intervals  rather  than  single  values  as  the  abstract 
sensor  estimates,  it  is  possible,  in  general,  to  have  two  or  more  distinct  overlapping 
intervals  as  the  estimate.  With  a  1-interval  estimate  and  a  binary  tree  network,  each 
node  has  to  integrate  three  abstract  estimates  (one  from  the  associated  sensor  and  one 
each  from  the  node’s  two  children).  The  four  ways  in  which  the  estimates  could  overlap 
are  shown  in  Figures  4.1(a)-(d).  The  only  case  of  distinct  overlapping  intervals  that  arises 
is  shown  in  Figure  4.1(b).  It  is  also  possible  that  no  intervals  overlap  (Figure  4.1(d)).  In  both 
these  cases,  we  choose  an  estimate  that  spans  the  intervals  of  interest.  By  doing  so,  our 
estimate  would  contain  sub-intervals  that  are  unlikely  to  contain  the  correct  value  of  the 
physical  variable  (examples  are  Sa  in  Figure  4.1(b),  Sb  and  Sc  in  Figure  4.1(d)).  From 
these  observations  it  is  also  clear  that  the  inaccuracy  of  the  estimate  is  proportional  to  the 
width  of  the  estimate.  Each  estimate,  however,  is  a  single  interval  and  is  transmitted  as 
one  message  over  the  communication  link.  When  information  is  not  integrated,  however, 
the  number  of  messages  sent  by  a  PE  at  level  i  in  the  worst  case  is  (  2  -  1).  Thua,  there  is  a 

trade-off  between  accuracy  and  communication.  The  reader  is  referred  to  [6]  for  further 
discussions  on  message  complexity.  Further,  note  that  when  the  non-root  PE  p  does  not 
perform  anv  information  integration,  the  computation  time  at  the  root  node  increases. 
The  computation  time  with  information  integration  turns  out  to  be  better  than  that 
without  information  intetrration  bv  a  factor  of  0(n)  where  n  is  the  number  of  nodes  in  the 
network  fb]. 
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Consider  two  1-interv'al  estimates  given  by  the  intervals  <li,  U|>  and  <h,  U2>.  Let 
I  ui  -  b  I  =  0.  If  ui  >  I2  then  the  integration  algorithm  chooses  the  intersecting  interval  of 
width  (u]  - 12).  Otherwise  (i.e.,  ui  <  b)  the  algorithm  chooses  the  spanning  interv'al  of  width 
(u2  -  li).  Clearly,  this  is  undesirable  since  small  errors  could  result  in  choosing  estimates 
with  large  derivations.  We  therefore  require  that  for  two  intervals  to  be  considered 
intersecting,  the  width  of  the  intersecting  interval  be  at  least  A  (this  width  is  a  design 
parameter  chosen  depending  on  the  sensor  characteristics,  the  roundoff  errors 
introduced  by  the  PE,  etc.) 

4.3  CLOCK  SYNCHRONIZATTON  IN  DSN 

Since  the  physical  sensor  outputs  typically  change  as  a  function  of  time,  it  is 
necessary  for  each  of  the  estimates  that  are  integrated  to  be  "close  to  each  other" 
temporally  in  order  for  the  integration  process  to  yield  meaningful  results.  This  is 
achieved  by  time-stamping  each  estimate.  If  the  estimates  from  different  sensors  are  to 
be  synchronized  to  obtain  a  final  estimate  at  each  of  the  commander  nodes,  then  it  is 
necessary  that  the  sensors  be  sampled  at  approximately  the  same  time. 
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12 

13 

Result 


(b)  (d) 

Figure  4.1  I1, 12,  l3  are  the  abstract  estimates.  Result  is  the 
1  -interval  abstract  sensor  computed  by  the  algorithm.  Note 
that  Sa,  Sb,  Sc  are  intervals  included  in  the  Result  even 
though  they  are  not  intersecting  intervals. 

In  a  distributed  environment  such  as  ours,  there  is  no  central  synchronized  clock  which 
regulates  the  activities  of  each  node.  Instead,  each  node  is  under  the  control  of  its  own 
clock.  Since  a  sensor  responds  to  real  time  activities,  it  is  convenient  for  the  clock  at  each 
node  to  provide  the  real,  i.e.,  physical  time.  Further,  since  the  estimates  from  different 
sensors  have  to  be  integrated,  it  is  convenient  to  have  the  time  provided  by  each  of  the 
sensor  nodes  to  be  close  to  each  other.  The  clock  at  each  node  may  not  be  accurate 
because  of  a  variety  of  reasons  such  as  clock  shift,  variations  in  temperature,  etc.  Each 
clock  therefore  has  to  periodically  synchronize  with  a  more  accurate  clock.  We  assume 
the  existence  of  a  central  time  server  on  one  PE  of  the  network  which,  when  requested  for 
the  time  at  t,  provides  the  time  C(t).  It  is  possible,  though  unlikely  when  fast  real  time 
communication  is  achievable,  that  in  the  case  of  some  DSNs  which  are  eeoeraohicallv 
spread  out  ,  there  will  he  a  need  for  many  time  servers.  Since  the  time  provided  by  two 
time  servers  may  themselves  not  agree,  clock  synchronization  with  multiple  time  servers 
is  more  complicated  than  with  a  single  time  server  (for  a  detailed  treatment,  see  [9]).  The 
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central  time  server  itself  periodically  synchronizes  with  a  universal  time  server.  The  latter 
is  always  accurate  and  lies  outside  our  environment. 

4.31  Clock  Behavior  and  Synchronization 

In  this  section  we  formalize  the  notion  of  integrating  abstract  estimates  which  are 
temporally  "close  to  each  other"  by  deriving  an  upper  bound  on  the  time  interval 
between  the  arrivals  of  two  abstract  estimates  at  a  node.  Let  Cp(t)  be  the  time  provided 
by  the  clock  on  PE  p  at  time  t.  Note  that  t  itself  is  not  observable  by  PE  p,  nor  by  any  of  the 
other  processing  elements. 

Let  us  assume  that  the  time  Cp(tl)  provided  by  PE  p  be  greater  than  C(tl),  the  time 
provided  by  the  central  time  server.  Let  p  now  synchronize  with  the  central  time  server.  It 
is  now  possible  that  the  new  time  Cp(t2)  is  less  than  Cp(tl).  An  abstract  estimate  sent  out 
later  at  Cp(t2)  seems  as  though  it  was  sent  earlier.  If  the  abstract  estimates  are  integrated 
in  a  non  First-in  First-out  fashion  at  a  node,  this  could  lead  to  problems.  Hence,  we 
require  that  the  time  on  a  PE  to  always  increase  monotonically.  This  monotonidty  could 
be  achieved  by  speeding  up  or  slowing  down  a  PE’s  clock  each  time  a  correction  is 
required  on  synchronizing  with  the  central  time  serv'er  [3]. 

From  the  foregoing  discussion,  we  can  now  state  the  following  requirements  for 
proper  synchronization  (the  subscript  p  refers  to  the  PE  p): 

Requirement  1:  Correct  Time:  The  deviation  in  time  of  each  clock  is  bounded. 


t  -  Cp(t)  1  <  Ep 

(la) 

t  -  C(t)  1  <  a 

(lb) 

Where  ep:  is  the  maximum  allowable  deviation  in  time  of  a  clock 
on  a  PE. 

a:  is  the  maximum  allowable  deviation  in  time  of  the 
clock  on  a  central  time  server. 

Requirement  2:  Correct  Rate: 

Between  resvnchronizations,  the  drift  rate  of  the  clock  is  bounded. 
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1  \<Kn 

(2a) 

dt 

(2b) 

Where  Kp:  is  the  maximum  allowable  drift  rate  in  time  of  a  clock 
on  a  PE. 

a:  is  the  maximum  allowable  drift  rate  of  the  clock  on 
the  central  time  server. 


Requirement  3: 

The  clock  on  each  of  the  PE  p  and  the  central  time  server  increase  monotonically. 

We  have  assumed  that  the  quantities  £p,Kp,a,0  are  all  fixed  and  are  known.  (These 
quantities  could  be  obtained  from  the  specifications  of  the  manufacturer's  handbook.)  If 
these  quantities  are  time  varying,  then  the  analysis  becomes  very  complicated.  For 
simplicity  we  will  assume  that  the  constants  8p  is  the  same  for  all  PEs  and  equals  £. 
Similarly  it  is  assumed  that  Kp  =  K.  From  this  simplification  and  Requirement  1,  the 
following  inequality  follows.  Let  Cq(t)  be  the  time  provided  by  the  clock  on  PE  q  at  time  t. 

Requirement  4:  Synchronization  Bound 

|Cp(t)-Cq(t)l<2£  (3) 

Let  ^min  and  ^max  be  the  minimum  and  maximum  values  of  the  delay  in  receiving 
the  message  sent  by  the  central  time  server  to  any  PE  (the  message  contains  the  time  C(t) 
at  time  t ).  Let  5min  and  5max  be  the  corresponding  values  for  a  message  sent  by  a  PE  to 
its  neighbor.  Let  y  be  the  maximum  tolerance  in  time  that  a  node  can  tolerate  between 
intervals  that  can  be  integrated.  This  value  of  y  has  to  be  derived  from  the  sensor 
characteristics  and  the  longest  path  between  the  leaf  nodes  and  the  commander  node 
(flog  n1  in  our  case  where  n  is  the  total  number  of  PEs). 

The  following  lemma  and  theorem  are  used  to  state  precisely  the  conditions  for 
combining  abstract  sensor  estimates  which  are  temporally  "close  to  each  other." 

Lemma  1:  Let  a  message  be  received  by  PE  p  at  Cp(t).  Then  this 
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message  was  sent  by  PE  p  or  its  neighbors  in  the  interval 
[Cp(t)-2e  -  5max  ,  Cp(t)  +  2e  -  6min  ] 

Proof:  The  proof  is  trivial  if  the  message  was  sent  by  PE  p's  sensor  since  Cp(t)  lies  in  the 
interval. 

Whenever  a  node  sends  a  message,  it  time  stamps  the  message  with  the  current 
time.  Because  of  the  delay  characteristics  of  the  channel  connecting  PE  p  to  its 
neighbor,  the  message  was  sent  in  the  interval 

[Cp(t)  -  5max ,  Cp(t)  -  6min  ] 

according  to  p's  clock.  Let  TS  be  the  time  stamp  on  the  message.  Then  from  (3),  it 
follows  that 

TS  G  [  Cp(t)  -  2£  -  8max  ,  Cp(t)  +  28  -  5min  ]  ■ 

The  time  stamp  may  not  belong  to  the  interval,  say,  if  the  channel  becomes  faulty. 
Definition:  Let  an  abstract  estimate  time  stamped  at  TS  be  received  by  PE  p  at  time 
Cp(t).  The  estimate  is  said  to  be  proper  if 

TS  G  [  Cp(t)  -  2£  -  5max  ,  Cp(t)  +  2£  -  5min  ]. 

Theorem  2:  Let  the  three  proper  abstract  sensor  estimates  Il,l2  and  I3  be  received  by  PE 
p  at  times 


Cp(tl)  <  Cp(t2)  <  Cp(t3)  respectively 
Then  L  (i  =  2, 3)  can  be  integrated,  iff 

(Cp(ti)  -  Cp(tl)  +  4£  +  Omax  -  5min  )  <  y  (4) 

P-rQflf:  Since  the  estimates  are  proper  estimates,  we  can  deduce,  using  Lemma  1,  that 
these  estimates  originated  in  the  interval 

[  Cp(tj)  -  2F  -  §max  ,  Cp(tj)  !  2C  -  5iuin  j  (5) 


(1 <j<3) 
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If  I2  is  to  be  integrated  with  II,  the  spanning  time  interval  of  estimates  Ii  and  I2 
should  at  most  equal  y,  the  maximum  tolerance  in  time  that  a  sensor  can  tolerate. 
The  spanning  interval  is  obtained  from  (5)  with  j  =  1,  2  to  be 

Cp(t2)  -  Cp(tl)  +  4£  +  5max  -  5min 

Similar  argument  can  be  made  for  integrating  l3  with  II.  Inequality  (4)  then  follows. 


4.32  Clock  Resynchronization 

Since  the  clocks  on  the  central  time  server  and  each  of  the  PE  s  drift,  they  have  to 
be  periodically  reset.  In  this  section,  we  derive  an  upper  bound  on  the  time  period 
between  resynchronizations. 

Let  Ts  be  the  time  period  between  resynchronization  of  the  central  time 
server. 


Let  Tc  be  the  time  period  between  resynchronization  of  the  clock  on  a  PE  p. 


The  central  time  serv’er  resynchronizes  itself  every  Ts  seconds  with  a  perfect 
universal  time  server  which  exists  outside  the  environment  of  our  DSN.  The  central  time 
server  also  resynchronizes  the  clock  on  a  PE  every  Tc  seconds. 


Let  Tg  and  T^.  be  the  periods  corresponding  to  Ts  and  Tc  as  observed  by  the  central 


time  server. 

Lemma  3:  The  period,  as  observed  by  the  central  time  server,  between 
synchronizations  of  the  central  time  server  is  bounded  by 


T‘  < 
‘s  - 


y1. 

I 


1  ^ 


Proof:  From  (lb)  and  (2b),  we  can  derive 


a  Ts<  a 
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From  (2b),  Ts  is  in  the  interval 


Li  +  a  '  1  -oJ 


Hence 


i.e.. 


.1 


■^s 

1-ay 


<  a 


xUari-l 


Observation:  When  a  PE  p  is  resynchronized  by  the  central  time  server  at  time  t,  there  is 
a  transmission  delay  of  at  least  ^min  before  the  value  C(t)  reaches  p.  Hence 

Cp(t')  >  C(t)  +  ^min 

Because  of  the  drift  in  the  central  time  serv^er  and  the  variable  transmission  delay 
Cp(t')  lies  in  the  interval  [Cp(t)  -  OTs  +  ^min,  Cp(t)  +  OTs  +  qmax  ].  Hence,  there  could  be  an 
error  of  (2aTs  +  ^max  -  ^min  )  at  the  time  that  p's  clock  is  resynchronized.  From  (la)  and 
(2a), 


2aTs  +  (^max  -  ^min  )  +  K  Tc  <  £  (6) 

A  restatement  of  (6)  is  Theorem  4. 

Theorem  4:  The  time  period  between  resynchronizations  of  the  clock  on  a  PE  is  bounded 
by 

£  -  2aTs  -  (^max  -  <^min  ) 

Tc  < -  (7) 

K 


Proof:  Arguments  in  the  previous  paragraph.  ■ 

Observation:  A  restatement  of  (7)  with  observable  times  on  the  central  time  server  is 


20Ts 

£-“  —  (Cmax  -  Cmin 

1-a 


K 


-  a 


(8) 


Using  the  clock  and  network  parameters,  we  can  rewrite  the  equation  (8)  as 
follows: 


i  8  -  2a-  (^max  -  ^min  ) 

Tf,< -  -a  (9) 

C  V- 


4.4  A  CLOCK  SYNCHRONIZED  INFORMATION  INTEGRATION 
ALGORITHM 

This  section  presents  a  clock  synchronized  integration  algorithm  using  the  ideas 
presented  in  the  previous  two  sections.  At  each  node  the  integrated  information  is  a  1- 
interval  abstract  sensor  Ip  computed  from  the  three  abstract  estimates  II,  I2, 13  received  at 
each  node  Thus 


Ip  =  D  ( II,  I2, 13) 

The  computation  of  the  function  D  is  the  information  integration  algorithm.  As  we 
have  mentioned  earlier,  we  consider  only  a  1-interval  information  integration  algorithm. 
Further,  we  assume  that  the  state  of  the  system  at  any  instant  can  be  represented  by  a 
single  value.  (The  algorithm  can  be  extended  in  a  straightforward  manner  for  the  case  of 
a  vector  of  values  (see  the  Appendix)  with  each  value  being  represented  by  a  k-interval 
estimate.)The  algorithm,  called  One_Interval  _Integrate  and  shown  in  pseudo-Pascal  on 
the  next  page,  computes  the  1-interval  abstract  estimate  at  a  node  p  and  sends  it  to  its 
parent  tirno  starnping  it  with  the  current  time  CT.  Procedure  FORM_n_I2_I3,  which  is 
repeatedly  called  as  a  procedure  by  One_lmerval_Tntegrate,  integrates  the  1-interval 
abstract  estimates  that  are  temporally  close  to  each  other  using  lemma  1  and  theorem  2. 
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Algorithm  On8_lnterval_lntegrate; 

/*  A  The  minimum  width  of  overlap  for  two  intervals  considered  intersecting.  •/ 

/■*  CT  Current  time  as  seen  by  a  processing  element  clock.  '/ 
r  Ip  Integrated  abstract  sensor  estimate.  */ 

1.  Begin  FORMJ1J2J3 

2.  if  (width  (  Iinl2  nis)  *  A)  then  /•  See  Fig.  4.1(a)  •/ 

3.  Ii  «-  lowest  value  in  the  intersecting  interval; 

4.  hi  «-  highest  value  in  the  intersecting  interval; 

5.  else  if  (width  (limlj)^)and(width  (ljnlk)>A)then  (i  j.  j  ^  k,  i  ^  k,  1  <  i,  j,  k  <  3) 

/•See  Fig.4.1(b)*/ 

6 .  h  <-  lowest  value  in  the  left  intersecting  interval; 

7.  hi «-  highest  value  in  the  right  intersecting  interval, 

3.  else  if  (width  (hnli)^)  then  (i  ^  j,  1  <  i.  j  <  3)  /*  see  Fig.  4.1  (c)  */ 

9.  Ii  <—  lowest  value  in  the  intersecting  interval; 

10.  hi «—  highest  value  in  the  intersecting  interval; 

11.  else  r  no  intersecting  intervals  -  see  fig.  4.1(d)  */ 

12.  Ii  ♦-  lowest  value  in  the  spanning  interval; 

13.  hi  f-  highest  value  in  the  spanning  interval; 

14.  fl 

15.  fi 

16.  fl 

17.  lof_r|.,hi]; 

13.  if  p  IS  not  the  root  then 

19.  send(  parentfp),  Is.  CT); 

20.  fi 

21.  end. 
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Procedure  FORM_l1_l2_t3 

/*  Computes  the  K-interval  abstract  sensors  associated  with  a  PE's  abstract  sensors  and  the  PE’s 
children*/ 

/*  But  is  a  FIFO  buffer  into  which  messages  of  the  form  <  abstract  sensor,  time  stamp  >  are  sent  by  the 
abstract  sensors  and  received  by  the  PE  p  */ 
r  empty  (buf)  is  true  when  there  is  no  message  in  the  buffer  */ 

/'  The  variable  CT  in  each  PE  has  the  current  time  */ 


1.  begin 

2.  done  :=  false; 

3.  while  not  done  do 

4.  receive(l1.  t): 

5.  If  t  in  ^CT  -  2e  -  5max  ,  CT  +  2£  -  5min  j  then  r  from  lemma  1  */ 

6.  done  :=  true; 

r  i.e.  II  is  a  proper  K-interval  abstract  sensor  */ 

7.  first_sensor_tima  :=  CT; 

r  store  the  time  at  which  first  proper  K-interval  abstract  sensor  read  */ 

8.  fl 

9.  od 

10.  12.  l3<-0; 

11.  if  p  is  the  leaf  PE  then 

12.  goto  L3;  r  exit  a  leaf  PE  that  does  not  have  any  children  */ 

13.  fl 

14.  LI :  empty _2&3  :=  false; 

15.  while  empty(buf)  do 

/'  if  current  time  exceeds  the  max  transmission  delay  d  then  set  12.  13  d  */ 


16. 

17. 

18. 

19. 

20. 
21. 

22. 


23. 

24. 

25. 

26. 

27. 

28. 

29. 

30. 

31. 

32. 

33. 

34. 


35. 

36. 

37. 

38. 

39. 

40. 


if  ((  CT-  first_sensor)  -f  4e  +  5max  -  6min  )  >  ythen  r  from  theorem  2  '/ 

12  < —  0  ;  13  < —  d  ; 
empty _2&3  ;« true; 
fl 

if  not  (  empty_2&3  )  then  /*  a  K-interval  abstract  sensor  was  read  */ 
receive{l2,  t); 

if  not  ( t  in  ^CT  -  2e  -  5max  ,  CT  -*•  2e  -  6min  j)  then  r  from  lemma  1  */ 

r  i.e..  this  abstract  sensor  is  not  a  proper  abstract  sensor "/ 
go  to  LI ; 
fl 

L2:  empty _3  :=  false; 

/*  the  2nd  proper  abstract  sensor  has  been  read.  Wait  for  the  third  one  '/ 
while  empty  (  buf )  do 

if  ((  CT-  first_s9nsor )  +  4e  +  5max  -  6min  )  >  ythen  r  from  theorem  2  */ 
13  <-0; 

empty _3  :=  true; 

fl 

od 

if  not  (  empty_3)  then  /*  third  K-interval  abstract  sensor  was  read  "/ 
receive!  13,  t  ); 

If  not  { t  in  ^CT  -  2£  -  5max  .  CT  +  2z  -  5min  j)  then  /'from  lemma  1'/ 

/'  i.e.,  this  abstract  sensor  is  not  a  prooer  abstract  sensor "/ 
go  to  L2; 

fl 

go  to  L3; 
fi 
fl 


od 


41.  end  procedure. 


Observe  that  in  our  integration  algorithm,  the  abstract  sensor  estimate  associated 
with  a  PE  at  level  i  could  affect  the  information  integration  in  the  same  manner  as  the 
aggregate  information  produced  by  the  children  of  the  node  rooted  at  level  i.  This 
method  of  integration  could  be  altered  by  assigning  weights  to  the  intervals  such  that  the 
individual  sensor's  abstract  estimate  is  not  as  important  as  the  abstract  estimate 
produced  bv  the  aggregate.  A  sensor's  abstract  estimate  could  be  weighted,  for  example, 
in  inverse  proportion  to  its  level  in  the  hierarchy. 

In  the  Appendix,  we  present  a  computational  characterization  of  the  combination 
function  for  1-interval  abstract  sensor  estimate  and  for  k-interval  estimates  using 
Heaviside  functions. 


5.0  FAULT  TOLERANT  ISSUES 

Assuming  that  the  PEs  in  the  network  are  reliable,  the  main  sources  of  faults  in  our 
system  are  the  sensors,  the  channels  connecting  the  PEs,  and  the  various  clocks  including 
the  central  time  server.  We  look  at  each  of  the  faults  and  their  effects. 

Sensor  Faults:  An  abstract  estimate  is  faulty  if  the  associated  sensor  produces  a  faulty 
output.  Observe  that  if  c  (  c  >  2,  c  =  2  or  3  in  our  case  )  abstract  estimates  are  non-faulty, 
then  the  intervals  represented  by  the  estimates  must  intersect.  On  the  other  hand,  if  two 
abstract  estimates  are  faulty,  then  it  is  quite  unlikely  that  the  faulty  estimates  overlap 
because  of  the  unpredictable  nature  of  the  associated  sensor  faults.  Consequently,  with 
the  presence  of  even  one  non-faulty  abstract  estimate,  the  1 -interval  abstract  estimate  is 
highly  likely  to  contain  the  correct  abstract  estimate.  Of  course,  with  the  presence  of  fvv'o 
faultv  abstract  estimates,  the  inaccuracy  of  the  1-interval  abstract  estimate  increases 
since  the  interval  of  the  estimate  is  wider.  This  reasoning  carries  over  to  the  higher  level 
nodes  where  the  1 -interval  abstract  estimates  are  also  integrated.  Our  network,  hence, 
tolerates  faults  well  and  degrades  gracefully  as  the  number  of  faults  increase.  Further, 
the  effect  of  a  fault  is  not  propagated  but  localized  to  that  node  and  all  its  children.  The 
network  is  thus  resilient  to  failures.  In  the  unlikely  case  of  all  the  faulfv'  abstract  estimates 
having  overlapping  intervals,  the  number  of  faulty  sensors  that  the  network  can  tolerate 
is  approximatelv  n/8  (  n  is  the  number  of  nodes  in  each  cluster;  for  a  derivation  of  this 
result  see  [6]). 


Observe  that  from  a  computational  point  of  view,  noise  signals  which  significantly 
alter  the  output  of  a  physical  sensor  and  faulty  sensors  are  indistinguishable-  both  result 
in  a  faultv  abstract  estimate.  The  arguments  which  we  made  to  justify  the  fault  tolerance 
of  our  scheme  would  now  hold  in  a  situation  where  there  is  a  low  signal  to  noise  ratio  and 
the  noise  signals  picked  up  by  the  various  sensors  are  uncorrelated. 

Channel  Faults:  Observe  that  if  the  channel  fails  or  is  very  slow,  i.e.,  when  the 
transmission  time  exceeds  5max  /  ^  proper  abstract  estimate  will  not  be  received  by  the 
PE.  The  information  integration  algorithm  essentially  ignores  the  message  passing 
through  the  channel  but  continues  with  the  integration  of  the  other  estimates.  Again,  the 
DSN  does  not  fail  but  degrades  gracefully. 

Clock  Faults:  Even  through  the  clocks  are  imperfect,  the  integration  algorithm  works  as 
long  as  the  requirements  specified  in  section  4.31  are  met.  When  a  clock  on  a  PE  fails, 
however,  it  would  then  transmit  only  the  abstract  estimate  associated  with  its  sensor  (this 
is  because  the  PE  has  no  way  of  knowing  that  its  clock  has  failed).  Since  the  estimate  is 
timestamped,  it  would  not  constitute  a  proper  estimate  to  its  parent  node  (unless  the 
parent  node  has  also  failed  in  an  identical  manner  which  is  highly  unlikely!).  Hence,  the 
effect  of  a  clock  failure  is  linaited  to  the  node  and  the  nodes  rooted  at  that  node.  Further, 
since  every  clock  is  periodically  resynchronized,  the  effect  of  a  clock  failure  lasts  only 
between  resynchronizations. 

In  our  DSN,  we  assumed  the  existence  of  a  single  central  time  server.  The  failure 
of  the  central  time  server  is  problematic,  however.  One  solution  is  to  have  a  number  of 
central  time  servers.  These  are  several  intricate  problems  that  need  to  be  solved  in  using 
many  central  time  servers.  For  a  discussion,  the  reader  is  referred  to  [9]. 

Observe  that  in  our  DSN,  the  effect  of  the  failure  of  a  node  or  channel  at  the 
higher  levels  of  the  network  is  more  drastic  than  a  failure  at  the  lower  levels.  It  would  be 
worthwhile  to  investigate  DSNs  in  which  these  failures  have  the  same  effect  irrespective 
of  their  position  in  the  network. 


6.0  CONCLUSION 


The  effective  use  of  multisensor  systems  requires  the  solution  of  various  problems 
relating  to  sensor  models,  the  architecture  of  the  sensor  network,  the  integration  of 
information  at  each  node  of  the  network,  the  cost  of  information  transmission,  and  the 
fault  tolerance  of  the  network.  The  integration  of  information  in  real  time  requires  the 
clocks  at  each  of  the  nodes  be  synchronized.  Synchronization  of  clocks  is  a  non-trivial  task 
in  such  distributed  sensor  networks.  In  this  paper  we  have  proposed  an  architecture 
based  on  binary  trees  for  the  sensor  network  and  considered  a)  how  information  could  be 
integrated  in  real  time  when  clocks  at  the  nodes  are  not  perfect;  b)  how  information  could 
be  efficiently  transmitted  without  incurring  heavy  communication  costs;  c)  how  the 
network  is  tolerant  to  certain  types  of  faults.  As  far  as  we  are  aware,  this  is  the  first  study  in 
the  area  of  distributed  sensor  networks  that  looks  at  the  various  computational  issues  in 
such  networks.  Since  our  focus  has  been  on  the  issues  just  mentioned,  we  have  chosen  to 
represent  information  with  a  simple  and  elegant  model  based  on  real  valued  interval  due 
to  Marzullo  [12].  We  are  aware  that  sensor  modeling  is  itself  a  detailed  area  of  study  [2, 
10].  We  have  assumed  that  the  output  of  each  sensor  is  a  physical  value.  The  above 
discussion  and  results  easily  extend  to  the  case  when  the  output  of  a  sensor  is  a  vector 
rather  than  a  single  value. 

This  study  could  be  extended  in  several  directions.  A  straightforward  extension,  as 
suggested  at  the  end  of  Section  4,  is  to  assign  weights  to  the  abstract  estimates  produced 
as  a  function  of  its  level  in  the  hierarchy  A  future  goal  of  our  project  is  to  investigate  more 
sophisticated  integration  algorithms  and  also  other  sensor  models.  Depending  upon  the 
application  that  a  DSN  gets  used  for,  other  DSN  architectures  could  be  studied.  An 
architectural  studv  could  also  be  based  on  the  fault  tolerant  properties  desired.  Finally, 
this  studv  can  be  extended  to  the  integration  of  complementary  information. 
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APPENDIX 

COMPUTATIONAL  CHARACTERIZATION 


A,1  l-Inten^al  Abstract  Sensor  Estimate 
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W e  define  a  useful  and  familiar  function  which  will  form  the  germ  of  the  algebra 
of  information  integration,  to  characterize  the  output  abstract  sensor  estimate  Ip  which 
depends  on  the  input  abstract  sensor  estimates  II,  I2, 13.  We  first  consider  the  case  of  1- 
incerval  abstract  estimates  and  then  go  on  to  generalize  the  idea  to  k-interval  abstract 
estimates. 


10  V  X  <  a 
DefweH.(x)^i  Vx>a 


H.  M 


1 

0 

a 

Figure  A.l  The  Heaviside  function. 

Ha(x)  is  called  the  Heaviside  function  (Figure  A.l).Using  this  function  we  can 
construct  a  function  which  takes  the  value  1  over  a  desired  interval  say  [a,b]  and  zero 
everywhere  else  as  follows  : 

Set 

Hia,hi  (X)  =  Ha(x)  {  1  -  Hb(x) ) 


This  clearly  is  the  required  function  (Figure  A.l). 


Ha  (h) 


Indeed  ; 


Hla,  bl  (x)  = 


[l  a  <  X  <  b 
lo  |Vx>b 


The  set  of  values  for  which  a  function  is  non-zero  is  called  the  support  Supp  of  t’  c 
function. 


Thus,  Supp  [H[a,  bl  (x)  ]  =  [a,  b). 
It  is  easy  to  see  that 


1  H[a,b]  (x)  dx  =  b  -  a. 


This  gives  the  width  of  the  interval  [a,b). 

Let  Ii  =  [a],bl],  I2  =  [a2,b2],  I3  =[a3,b3]  be  the  1-interval  abstract  sensor  estimates 
entering  the  processor  p.  We  wish  to  integrate  the  above  inputs  into  an  output  which  is  a 
reliable  index  of  the  sensor  values.  To  achieve  this  we  follow  the  prescription  of  the 
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integration  algorithm  and  obtain  a  functional  realization  using  the  Heaviside  function 
and  other  auxiliary  functions. 

j 


Define  Hj.(x)^H^.(x)(l-H^.(x))  V  l<j<3. 

-2 

Thus  Hjj  (x )  has  support  Ij,  i.e.,  Supp  [H  ]  =  Ij  =  H  jj  (1) 


We  consider  three  possibilities  for  the  1-interval  sensor  inputs  with  regard  to  their 
relative  positions  inducing  overlap  or  reinforcement  of  sensor  information  and  integrate 
in  each  case  the  inputs  in  accordance  with  the  integration  algorithm  to  get  a  fault  tolerant 
output  Ip. 

Case  (1).: 


Ij  n  Ik  =  <[)  V  j  k,  1  <  j,  k  <  3. 

Here  the  function  H;j  +  Hy2  has  its  support  as  a  reliable  estimate  of  the 

possible  region  of  the  value  of  Ip  :  the  output  abstract  estimate.  The  support  is  the  union 
of  the  disjoint  intervals  II,  I2, 13  (Figure  A.3). 

Hj^(x)  =  (x)^Hi2  (x)  +  h^3  (x) 
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Figure  A.3  Support  function  characterization  for  the  union  of  disjoint  intervals. 

Case  (2) : 

( Ij  n  Ik  d  for  some  l<j,  k  <3)  a  (Iin  I2  n  I3  =  p) 

Here  the  function  Hjj  +  has  its  support  as  a  reliable  estimate 

of  the  possible  region  of  the  value  of  Ip.  The  support  is  the  overlap  regions  of  the  disjoint 
intervals  II,  I2, 13  (Figure  A.4). 
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Figure  A.4  Support  function  characterization  of  overlap 
regions  of  the  disjoint  intervals  I  ^  I  o,  la- 

Case  (3)  ; 


Iin  I2  nl3  d- 

Ffere  the  function  h^2  has  its  support  as  a  reliable  estimate  of  the  possible 

region  of  the  value  of  Ip.  The  support  is  the  common  overlap  region  of  the  intervals  II,  I2, 
I3  (Figure  A.5). 
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However,  it  is  desirable  to  have  just  one  function  subsuming  the  above  three  cases 
so  that  the  job  of  checking  for  the  kind  of  overlap  of  intervals  can  be  avoided.  To  effect 
this  computationally  we  define  a  selector  function  f  on  functions  as  follows  : 


If  g  :  R  R  is  any  single  valued  real  function,  then  define  : 

f(  g(  X  ))  = 


0  V  X  G  R  if  g(x)  >  1  for  some  x  e  R 
1  V  X  e  R  if^f.t)  <  1  for  V  x  e  R 


This  function  chooses  the  zero  function  or  the  function  g(x)  at  hand  accordingly  as 
to  whether  g(x)  takes  value  greater  than  1  or  stays  strictly  less  then  or  equal  to  1.  We  need 
to  choose  in  cases  1,2  and  3  the  appropriate  function  constructed  under  the  case.  While 
amalgamating  these  functions  by  means  of  a  sum  we  need  to  suppress  the  other  two 
functions  since  they  are  irrelevant.  Hence  we  operate  the  selector  function  f  taking 
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advantage  of  the  fact  that  the  irrelevant  functions  are  themselves  zero  or  take  values 
greater  than  1. 

Now  define  as  follows : 


Ip  =  (x)  is  the  desired  output  abstract  sensor  estimate.  This  is  not  in  general  a 

1-interval  estimate,  but  gives  a  sharper  and  narrower  estimate  than  the  associated  single 
interval  estimate  in  the  algorithm.  We  however  can  obtain  a  single  interval  estimate  also 
by  filling  up  the  gaps  as  shown  later. 

The  total  interval  width  of  Ip  is  given  by 

— oo 

W(/p}=  ^H^^(x)dx 


We  generalize  these  ideas  and  results  to  obtain  a  functional  characterization  of 
the  combination  function  for  a  k-interval  abstract  sensor  estimate. 

A. 2  The  Combination  Function  for  k-Interval  Estimates 


Suppose  Ip  =  ilm[m=i  /  'vhere  1^  = 


a  ’  b  ^ 
“m '  ^  m 


,  1  <  j  <  k  are  the  proper  k-interval 


abstract  sensor  estimate  inputs  to  the  processor  p.  Ij  is  clearly  a  vector  of  k  components. 

s 

If  II, . . .,  Is  are  s  disjoint  intervals  and  I  =  ^  Ij,  then  define 

j=l 


I''  =  |at  +  P(1  - 1)  I  a  ,  P  €  I  and  t  e  [0, 1  ]| 


Thus  if 


'r 


a,,bj 


,  1  <  j  <  s  and  ai<bi<  ...  <as<bs,  then  1'^= 


,b 
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^  :  I  ^  can  be  called  the  filler  function  and  from  its  definition,  it  is  clear  that  it 
fills  up  the  gap  between  disjoint  intervals  to  give  one  single  connected  interval. 

We  now  carry  out  the  integration  of  the  vectorial  input  sensor  estimates  II  l2  and  l3 
componentwise.  The  componentwise  integration  is  the  same  as  the  1 -interval  integration 
except  that  we  use  the  filler  function  to  get  a  1-interval  output  estimate,  so  that  the 
vectorial  output  is  also  a  k  component  vector.  To  this  effect : 

Define  Ip  = 


where  I^  =  Suppj  H  jP 


=Supp] 


Thus  Ip  is  the  output  k-interval  estimate  where  the  integration  has  been 
performed  component-wise  following  the  example  of  the  1 -interval  estimates  discussed 
above.  The  width  vector  of  the  k-interval  estimate  output  Ip  is  given  by  : 


r 


\V(Ip)  = 


H  iP  (x)dx  ^ 

‘m 


k 

m=l 


J 


W(Ip)  is  a  vector  with  k-components  with  each  component  being  a  non-negative 
real  number  and  not  an  interval  as  in  the  case  of  II,  I2, 13  and  Ip  .  We  may  further  obtain  a 
scalar  measure  of  the  width  by  integrating  in  some  manner  desirable  (e.g.,  taking  a 
weighted  sum)  the  components  of  W(Ip) 

Thus  we  have  described  functionally  the  scheme  of  integration  of  the  abstract 
sensor  inputs  II  I2  and  I3  to  obtain  an  abstract  sensor  output  in  the  case  of  1-interval  and 
then  generalized  the  idea  to  k-interval  estimates  by  componentwise  integration.  This 
integration  is  in  accordance  with  the  integration  algorithm.  We  have  treated  here  the 
case  of  the  sensors  giving  a  uniformly  distributed  signal  over  the  associated  interval. 
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However,  if  the  sensor  can  weight  the  values  of  its  dense  output,  then  we  can  introduce 
the  appropriate  weight  function  in  place  of  products  of  Heaviside  function.  The  rest  of  the 
analysis  may  be  carried  out  in  an  analogous  manner  with  a  few  modifications.  The 
interval  width  measures  in  the  case  of  non-uniform  distribution  of  signal  values  are 
genuine  integrals  and  not  mere  measures  of  rectangular  areas  as  in  our  present 
discussions.  The  reason  for  writing  interval  width  measures  in  this  discussion  as  integrals 
is  to  suggest  this  generalization. 

The  number  of  inputs  considered  so  far  is  three  as  dictated  by  the  architecture  of 
the  sensor  network.  However,  our  functional  representation  is  easily  extendible  to  the 
case  when  the  architecture  is  not  just  binary  but  n-ary  at  each  processor. 
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LIST  OF  SYMBOLS 


A 


ep,  £ 
a 


Kp.K 

a 

Smin,  5max 
^min  j^max 

7 

CT 

li 

(ui  -  li) 


The  minimum  width  of  overlap  for  two  intervals  to  be  considered 
intersecting. 

Maximum  allowable  deviation  in  time  of  a  clock  on  a  PE. 

Maximum  allowable  deviation  in  time  of  clock  on  the  central  time 
server. 

Maximum  allowable  drift  rate  in  time  of  a  clock  on  a  PE. 

Mciximum  allowable  drift  rate  of  the  clock  in  the  central  time  server. 

Minimum  and  Maximum  channel  transmission  delay. 

Minimum  and  Maximum  delay  in  receiving  the  message  sent 
by  the  central  time  server  to  any  PE. 

Maximum  tolerance  in  time  that  a  node  can  tolerate  between 
intervals  that  can  be  integrated. 

Current  time  as  seen  by  a  processing  element's  clock. 

i^l’  abstract  estimate. 

Width  of  an  interval. 


